An empirical comparison and characterisation of nine popular clustering methods

نویسندگان

چکیده

Nine popular clustering methods are applied to 42 real data sets. The aim is give a detailed characterisation of the by means several cluster validation indexes that measure various individual aspects resulting clusters such as small within-cluster distances, separation clusters, closeness Gaussian distribution etc. introduced in Hennig (in: Data analysis and applications 1: regression, modeling—estimating, forecasting mining, ISTE Ltd., London, 2019). 30 sets come with “true” clustering. On these similarity clusterings from nine explored. Furthermore, mixed effects regression relates observable clusterings, which problems unobservable. study gives new insight not only into ability discover but also properties can be expected methods, crucial for choice method situation without given

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

on the comparison of keyword and semantic-context methods of learning new vocabulary meaning

the rationale behind the present study is that particular learning strategies produce more effective results when applied together. the present study tried to investigate the efficiency of the semantic-context strategy alone with a technique called, keyword method. to clarify the point, the current study seeked to find answer to the following question: are the keyword and semantic-context metho...

15 صفحه اول

Popular Ensemble Methods: An Empirical Study

An ensemble consists of a set of individually trained classifiers (such as neural networks or decision trees) whose predictions are combined when classifying novel instances. Previous research has shown that an ensemble is often more accurate than any of the single classifiers in the ensemble. Bagging (Breiman, 1996c) and Boosting (Freund & Schapire, 1996; Schapire, 1990) are two relatively new...

متن کامل

investigation of single-user and multi-user detection methods in mc-cdma systems and comparison of their performances

در این پایان نامه به بررسی روش های آشکارسازی در سیستم های mc-cdma می پردازیم. با توجه به ماهیت آشکارسازی در این سیستم ها، تکنیک های آشکارسازی را می توان به دو دسته ی اصلی تقسیم نمود: آشکارسازی سیگنال ارسالی یک کاربر مطلوب بدون در نظر گرفتن اطلاعاتی در مورد سایر کاربران تداخل کننده که از آن ها به عنوان آشکارساز های تک کاربره یاد می شود و همچنین آشکارسازی سیگنال ارسالی همه ی کاربران فعال موجود در...

An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering

Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...

متن کامل

An empirical comparison of three inference methods

In this paper, an empirical evaluation of three infer­ ence methods for uncertain reasoning is presented in the context of Pathfinder, a large expert system for the diagnosis of lymph node pathology. The inference procedures evaluated are (1) Bayes' theorem, a:ssum­ ing evidence is conditionally independent given each hypothesis, (2) odds-likelihood updating, assuming evidence is conditionally ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Advances in data analysis and classification

سال: 2022

ISSN: ['1862-5355', '1862-5347']

DOI: https://doi.org/10.1007/s11634-021-00478-z